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x ^Pl.ANT LONG CHAIN .rm RTQSYNTHETICENZY^ 

FIELD OF THE INVENTION 

The invention relates to a condensing enzyme involved in long chain fatty aad 
production in plants, including related nucleic acid sequences. 

BACKGROUND 

Living organisms synthesize a vast array of different fatty acids, winch are mcorporated 
into complex lipids. These complex lipids represent both major structural membrane 

chain fattv acids (VLCFAs, chain length C20 or longer) are synthesized predonunantly m the 
epidermal cells where they are either directly incorporated into waxes, or serve as precursors 
for other aliphatic hydrocarbons found in waxes, including alkanes, primary and secondary 
alcohols, ketones, aldehydes and acyl-esters (for review see Post-Beittenmiller, 1996). 
VLCFAs also accumulate in the seed oil of some plant species, where they are mcorporated 
into triacylglycerols (TAGs), as in the Brassicaceae, or into wax esters, as in Jojoba. These 
! !j seed VLCFAs include the agronomically important erucic acid (C22: 1), that may be used « 

the production of lubricants, nylon, cosmetics, pharmaceuticals and piasters. 

VLCFAs are synthesized by a microsomal fatty acid elongation (FAE) system which 
involves four enzymatic reactions: (1) condensation of malonyl-CoA with a long cham acyl- 
CoA (?) reduction to beta-hydroxyacyl-CoA, (3) dehydration to an enoyl-CoA and (4) 
lsI reduction of the enoyl-CoA, resulting in the acyl-CoA elongated by two carbons. The 

, • ~t;™ m i« a kev activity of the FAE system. It is the rate- 
condensing enzyme catalyzing reaction (1) is a key activity 

25 limiting enzvme of the VLCFA biosynthetic pathway, which controls the amount ofVLCFAs 
produced (Miller and Kunst, 1997). In addition, the condensing enzyme determines the 
ultimate VLCFA acyl chain length, and thus their uses in seed oil or wax biosynthesis. 

Different condensing enzymes acting on, and producing, acyl chains of different 
30 length have recently been characterized. Several groups independently identified the first 
plant fatty acid elongation gene in Arabics, FAE1 (James and Dooner, 1990; Kunst et 
al 1992- and Lemieux et at, 1990). FAE1 was subsequently cloned and is described in 
W09613582 as catalyzing the conversion of CIS fatty acids to C20-C22 fatty acids. The 
patent W09613582 suggests that FAE1 will have activity in a very broad host range. In 
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support of this assertion of broad host range activity, it has been shown that FAE1 has the 
same activity in yeast as inArabidopsis (Miller and Kunst, 1997). A Jojoba protein involved 
in the synthesis of VLCFAs has also been isolated having relatively high homology to FAE1 
(52% amino acid identity), and has been shown to have beta-ketoacyl-coenzyme A synthase 
5 (KCS) activity (W09515387). Broad host range activity for genes encoding KCS has been 
further evidenced by the Jojoba KCS cDNA. The Jojoba KCS cDNA was able to 
complement a mutation in a Canola variety of rapeseed (Brassica napus), to restore m the 
variety high levels of VLCFAs (Lassner et al., 1996). An Arabidopsis gene (Ct/77), 
required for cuticular wax biosynthesis and pollen fertility, has also been described as 
10 encoding a VLCFA condensing enzyme that catalyzes the addition of 2C units to pre-existmg 
C24 or longer fatty acids (Millar et al., 1999). 

The broad specificities exhibited by FAE activities provide a means of modifying the 
synthesis of VLCFAs in a given cell. As evidenced by the accumulation of VLCFAs in 
15 tobacco seed expressing FAE1 (Millar and Kunst, 1997), heterologous condensmg enzymes 
may be used to produce VLCFAs in plant species that do not otherwise synthesize VLCFAs. 
For example, targeted expression of heterologous VLCFA condensing enzymes in seeds may 
allow the production of crop plants capable of synthesizing VLCFAs of desired lengths m seed 
oil for industrial applications. New condensing enzymes may also be useful for the 
20 manipulation of cuticular waxes which have important functions in many physiologtcal 
processes in plants, including water balance, protection from UV light, plant-insect 
interactions, and defense against bacterial and fungal pathogens (Post-Beittenmiller, 1996). As 
a result, there is a need for new condensing enzymes that may be used alone or in combmauon 
with other condensing enzymes to confer on plants and plant tissues the ability to synthestze a 
25 range of VLCFAs, including VLCFAs up to C30 in length. 

There is also a need for tissue-specific promoters capable of mediating the expression 
of heterologous condensing enzymes in epidermal, cells, which may facilitate the alteration of 
the wax composition and/or accumulation in plants. This may, in turn, result in the production 
30 of crops with increased tolerance to environmental stresses, and/or resistance to pathogens and 
insects. For example, drought resistance in rice is associated with high wax lines nch n C29, 
C33 and C35 alkanes (OToole and Cruz, 1983; Haque et al., 1992). Increased wax deposmon 
may also be accomplished by overexpression of condensing enzymes with desired acyl chatn 
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length specificities using an q 
etal., 1999). 



pidermis-specific promoter, such as the CUT1 promoter (Millar 



eU are regulated by the natureof condensing enzyme expression in me cell (Mtller a,^d 
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promoter sequence is capable of mediating gene expression in anthers and in very young leaves 
in Arabidopsis. The promoter sequences of the invention may be derived from *n Arabidopsis 
KCS2 promoter sequence, as disclosed herein. Promoter sequences of the invention may also 
hybridize under stringent conditions to the Arabidopsis KCS2 promoter sequence disclosed 
5 herein. Promoter sequences of the invention may also be at least 70% identical to the 
Arabidopsis KCS2 promoter sequence when optimally aligned. 

The invention provides nucleic acid probes comprising probe sequences that hybridize 
under stringent conditions to a ponion of an Arabidopsis KCS2 genomic sequence; or are at 
10 least 70% identical to the portion of an Arabidopsis KCS2 genomic sequence when optimally 
aligned. The invention also provides methods of isolating a nucleic acid molecule encoding a 
plant long chain fatty acid condensing enzyme, for example by hybridizing a nucleic acid 
preparation with the nucleic acid probe of the invention. 

15 The invention provides transgenic cells (such as plant cells), plants and parts thereof, in 

which the plants or cells comprise the recombinant nucleic acid molecules of the invention. 
Such plant parts may for example include seeds or oils. Transgenic plants of the invention may 
have a modified phenotype compared to a non-transgenic plant of the same species, such as a 
modified lipid content. Methods of producing such transgenic plants are provided, for example 

20 by introducing into a plant the isolated nucleic acids of the invention. Progeny plants may be 
provided, produced by sexual or asexual propagation of the transgenic plants of the invention 
to produce transgenic descendants of transformed plants. 

Purified proteins are provided, encoded by the recombinant nucleic acid molecules of 
25 the invention, including plant fatty acid condensing (elongase) enzymes and fragments thereof. 
Also provided are recombinant vectors comprising the recombinant nucleic acid molecules of 
the invention. 

Antisense nucleic acid molecules are provided, wherein a portion of the nucleic acid 
30 coding sequences of the invention are provided in reverse orientation relative to an adjacent 
promoter sequence. Recombinant antisense nucleic acids of the invention may therefore 
encode an antisense RNA that hybridizes under stringent conditions to a complement of a 
portion of the Arabidopsis KCS2 coding sequence; or that are at least 70% identical to a 
portion of the Arabidopsis KCS2 coding sequence when optimally aligned. Transgenic plants 
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or plant cells of the invention may include the recombinant antisense nucleic acids of the 
invention. 

In one embodiment, the nucleic acid coding sequences of the invention may be 
substantially identical to all or part of an Arabidopsis KCS2 coding sequence. The nucleic 
acids of the invention may also include an RNA analog or a nucleic acid complementary to 
sequences of the invention. In other embodiments, the nucleic acid may be a fragment of one 
of the above sequences, such as a fragment that is at least 5, 10, 15, 20 or 25 nucleotides in 
length and that hybridizes under stringent conditions to a genomic KCS2 DNA encoding the 
nucleic acid sequence. As used herein, the term "genomic sequence" includes either of the 
strands of a nucleic acid molecule found in the genome of an organism or cell. 



In another aspect, the nucleic acids of the invention may include a DNA coding 
sequence encoding an enzyme of the invention, operably linked to a promoter. Promoters of 
5 the invention may be tissue-specific or have specific developmental timing. A promoter of the 
invention may have a nucleotide sequence substantially identical to all or part of the KCS2 
promoter region disclosed herein. Promoters of the invention may be operably linked to 
alternative DNAs, such as agronomically important nucleic acid sequences. 

.0 In one aspect, the invention provides methods for altering the VLCFA, fatty acid or 

lipid content in a plant or plant tissue, for example by introducing into a plant cell, capable of 
being transformed and regenerated to a whole plant, a nucleic acid of the invention. Where 
such nucleic acids are effective for altering the levels of VLCFAs in a plant; a plant containing 
the nucleic acid of the invention may be recovered having an altered phenotype. 

25 

THE DRAWINGS 

Figure 1 shows the coding strand DNA sequence of a KCS2 genomic clone from 
Arabidopsis. containing a 1001 bp long KCS2 promoter region (5' to the initiation codon) and a 
1508 bp long KCS2 coding sequence. The initiation (ATG) codon is indicated in all caps (at 
30 positions 1 046- 1 048). The Figure identifies locations homologous or complementary to 

oligonucleotide primers (probes) that may be used for amplifying the wild-type Arabidopsis 
KCS2 promoter (shown as kcs3 and kcs4) and the KCS2 coding sequence (kcsl and kcs2), 
these sequences are underlined and marked by arrows indicating the putative direction for 
amplification. 
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DETAILED DESCRIPTION OF THE INVENTION 

The recombinant nucleic acid molecules of the invention may comprise a heterologous 
nucleic acid coding sequence encoding the plant long chain fatty acid condensing enzyme of 
the invention. In alternative embodiments, the nucleic acid coding sequence may be derived 
from the Arabidopsis KCS2 coding sequence disclosed herein. 

The term "recombinant" means that something has been recombined, so that when 
made in reference to a nucleic acid molecule the term refers to a molecule that is comprised of 
nucleic acid sequences that are joined together by means of molecular biological techniques. 
The term "recombinant" when made in reference to a protein or a polypeptide refers to a 
protein molecule which is expressed using a recombinant nucleic acid molecule. The term 
"heterologous" when made in reference to a nucleic acid sequence refers to a nucleotide 
sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to 
which it is not ligated in nature, or to which it is ligated at a different location in nature. The 
term "heterologous" therefore indicates that the nucleic acid molecule has been manipulated 
using genetic engineering, i.e. by human intervention. 

Sequences may be derived or obtainable from the Arabidopsis KCS2 coding sequence 
by deduction and synthesis based upon the wild-type K.CS2 amino acid sequence, using the 
redundancy of the genetic code to formulate alternative coding sequences. Derived sequences 
may also be identified in different organisms, for example by isolation using as probes the 
nucleic acid sequences of the invention. Antibodies prepared against a KCS2 protein of the 
invention may also be used to identify alternative coding sequences, which are thereby derived 
from the coding sequence disclosed herein. Alternative KCS2 amino acid sequences of the 
invention, such as amino acid sequences developed through mutagenesis or substitution, may 
similarly be used to infer coding sequences derived from the Arabidopsis KCS2 coding 
sequence disclosed herein. 

Derived nucleic acids of the invention may be obtained by chemical synthesis, 
isolation, or cloning from genomic DNAs using techniques known in the art, such as the 
Polymerase Chain Reaction (PCR). Nucleic acids of the present invention may be used to 
design alternative primers (probes) suitable for use as PCR primers to amplify particular 
regions of an condensing enzyme cDN A of the invention. Such PCR primers may for example 
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comprise a sequence of 15-20 consecutive nucleotides of the KCS2 cDNA of the invention. To 
enhance amplification specificity, primers of 20-30 nucleotides in length may also be used. 
Methods and conditions for PCR amplification are described in Innis et al. (1990); Sambrook 
et al. (1989); and Ausubel et al. (1995). 

5 

As used herein, the term "probe" when made in reference to an oligonucleotide refers to 
an oligonucleotide which is capable of hybridizing to another oligonucleotide of interest A 
probe may be single-stranded or double-stranded. Probes are, for example, useful in the 
detection, identification, amplification and isolation of particular gene sequences. 
10 Oligonucleotide probes may be labelled with a "reporter molecule," so that the probe is 

detectable using a detection system, such as enzymatic, fluorescent, radioactive or luminescent 
detection systems. 

Derived nucleic acids of the invention may also be identified by Southern analysis, a 
15 method by which the presence of DNA sequences in a target nucleic acid mixture are identified 
by hybridization to a labeled probe, comprising an oligonucleotide or DNA fragment of a 
nucleic acid of the invention. Probes for Southern analysis may for example be at least 15 
nucleotides in length. Southern analysis typically involves electrophoretic separation of DNA 
digests on agarose gels, denaturation of the DNA after electrophoretic separation, and transfer 
20 of the DNA to nitrocellulose, nylon, or another suitable membrane support for analysis with a 
radiolabeled, biotinylated, or enzyme-labeled probe as described in Sambrook et al (1989). 

In alternative embodiments, the invention includes nucleic acids that encode the plant 
very long chain fatty acid condensing enzyme of the invention, wherein the enzyme has an 
25 amino acid sequence that is at least 70% identical to an Arabidopsis KCS2 amino acid 

sequence disclosed herein, when optimally aligned. In alternative embodiments, the degree of 
identity may be between 50% and 100%, such as 60%, 80%, 90%, 95% or 99%. 



As is well known in the art, some modifications and changes can be made in the 
30 structure of a polypeptide without substantially altering the biological function of that peptide, 
to obtain a biologically equivalent polypeptide. In one aspect of the invention, fatty acid 
condensing enzymes may include peptides that differ from a portion of the wild-type 
Arabidopsis KCS2 sequence by conservative amino acid substitutions. As used herein, the term 
"conserved amino acid substitutions" refers to the substitution of one amino acid for another at 
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a given location in the peptide, where the substitution can be made without substantial loss of 
function. In making such changes, substitutions of like amino acid residues can be made, for 
example, on the basis of relative similarity of side-chain substituents, for example, their size, 
charge, hydrophobicity, hydrophilicity, and the like, and such substitutions may be assayed for 
their effect on the function of the peptide by routine testing. 

In some embodiments, conserved amino acid substitutions may be made where an 
amino acid residue is substituted for another having a similar hydrophilicity value (e.g., within 
a value of plus or minus 2.0), where the following hydrophilicity values are assigned to amino 
acid residues (as detailed in United States Patent No. 4,554,101, incorporated herein by 
reference): Arg (+3.0); Lys (+3.0); Asp (+3.0); Glu (+3.0); Ser (+0.3); Asn (+0.2); Gin (+0.2); 
Gly (0); Pro (-0.5); Thr (-0.4); Ala (-0.5); His (-0.5); Cys (-1 .0); Met (-1.3); Val (-1.5); Leu (- 
1.8); lie (-1.8); Tyr (-2.3); Phe (-2.5); and Trp (-3.4). 

In alternative embodiments, conserved amino acid substitutions may be made where an 
amino acid residue is substituted for another having a similar hydropathic index (e.g., within a 
value of plus or minus 2.0). In such embodiments, each amino acid residue may be assigned a 
hydropathic index on the basis of its hydrophobicity and charge characteristics, as follows: lie 
(+4.5); Val (+4.2); Leu (+3.8); Phe (+2.8); Cys (+2.5); Met (+1 .9); Ala (+1 .8); Gly (-0.4); Thr 
(-0.7); Ser (-0.8); Trp (-0.9); Tyr (-1 .3); Pro (-1.6); His (-3.2); Glu (-3.5); Gin (-3.5); Asp (- 
3.5); Asn (-3.5); Lys (-3.9); and Arg (-4.5). 

In alternative embodiments, conserved amino acid substitutions may be made where an 
amino acid residue is substituted for another in the same class, where the amino acids are 
divided into non-polar, acidic, basic and neutral classes, as follows: non-polar: Ala, Val, Leu, 
He, Phe, Trp, Pro, Met; acidic: Asp, Glu; basic: Lys, Arg, His; neutral: Gly, Ser, Thr, Cys, Asn, 
Gin, Tyr. 

The nucleic acid coding sequences of the invention also include sequences that 
hybridize under stringent conditions to a complement of the Arabidopsis KCS2 coding 
sequence disclosed herein. The nucleic acid coding sequences of the invention may also be at 
least 70% identical to the Arabidopsis KCS2 coding sequence, when optimally aligned. 
Embodiments of the invention include isolated nucleic acid molecules comprising the coding 
sequences of the invention. 
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The present invention provides an isolated nucleic acid encoding a plant elongase 
enzyme that mediates biosynthesis of 18:0, 20:0, 22:0 and 24:0 fatty acids in plants. In a 
preferred embodiment, the nucleic acid encodes KCS2 identified in Arabidopsis shown in 
Figure 1. 

By isolated, it is meant that the isolated substance has been substantially separated or 
purified away from other biological components with which it would otherwise be associated, 
for example in vivo. The term 'isolated' therefore includes substances purified by standard 
purification methods, as well as substances prepared by recombinant expression in a host, as 
well as chemically synthesized substances. 



The nucleic acids of this invention may be genomic or cDNA and may be isolated from 
cDNA or genomic libraries or directly from isolated plant DNA. Nucleic acids of this 

5 invention further include sequences corresponding to the KCS2 protein as well as sequences 
obtainable from the KCS2 protein or nucleic acid sequences. By corresponding is meant 
nucleic acid sequences, either DNA or RNA, including those which encode the KCS2 protein 
or a portion thereof, the promoter sequence found 5' to said encoding sequence, intron 
sequences not present in the cDNA, as well as sequences encoding any leader or signal peptide 

20 of a precursor protein that may not be found in the mature elongase protein. A nucleic acid of 
the invention may further comprise additional nucleic acids. For example, linkers, modified or 
unmodified restriction endonuclease sites and other nucleic acid sequences useful for cloning, 
expression, or purification. 

25 Sequences having substantial identity will hybridize under stringent conditions. 

Hybridization to filter-bound sequences may for example, be performed in 0.5 M NaHP0 4 , 7% 
sodium dodecyl sulfate (SDS), 1 mM EDTA at 65'C, and washing in 0.2 x SSC/0.1% SDS at 
42'C (Ausubel et al, 1995). Alternatively hybridization to filter-bound sequences may for 
example, be performed in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
30 65'C, and washing in 0.1 x SSC/0.1% SDS at 68'C (Ausubel et al, 1995). Hybridization 

conditions may be modified in accordance with known methods depending on the sequence of 
interest (Tijssen, 1993). Generally, stringent conditions are selected to be about 5'C lower 
than the thermal melting point for the specific sequence at the defined ionic strength and pH. 
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A nucleic acid may also be identified by Northern analysis, a method used to identify 
RNAs that hybridize to a known probe such as an oligonucleotide, DNA fragment, cDNA or 
fragment thereof, or RN A fragment of a nucleic acid of the invention. The probe is labeled 
with a radioisotope such as 32 P, by biotinylation or with an enzyme. The RNA to be analyzed 
5 may be electrophoretically separated on an agarose or polyacrylamide gel, transferred to 
nitrocellulose, nylon, or other suitable membrane, and hybridized with the probe, using 
standard techniques well known in the art such as described in Sambrook et al. (1 989). 

A nucleic acid of the invention may alternatively be obtained by immunological 
10 screening. Antibodies to the KCS2 protein may be used to identify related protein in extracts 
of desired plant species. Two amino acid sequences are considered substantially identical if 
one peptide is specifically immunologically reactive with antibodies that are also specifically 
immunoreactive against the other peptide. Specific immunoreactivity of antibodies to peptides 
may be assessed using a variety of immunoassay formats, such as solid-phase ELISA 
15 immunoassays for selecting monoclonal antibodies specifically immunoreactive with a protein 
(Harlow and Lane, 1988). 

Optimal alignment of sequences for comparisons of identity may be conducted using a 
variety of algorithms, such as the local homology algorithm of Smith and Waterman (1981), 
the homology alignment algorithm of Needleman and Wunsch (1970), the search for similarity 
20 method of Pearson and Lipmari (1988), and the computerized implementations of these 
algorithms (such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group, Madison, WI, U.S.A.). Sequence alignment 
may also be carried out using the BLAST algorithm described in Altschul et al. (1990), using 
the published default settings. 

25 Software for performing BLAST analysis may be available through the National Center 

for Biotechnology Information (through the internet at http://www.ncbi.nlm.nih.govA- The 
BLAST algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database sequence. 

30 T is referred to as the neighbourhood word score threshold. Initial neighbourhood word hits act 
as seeds for initiating searches to find longer HSPs. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Extension of the word hits in each direction is halted when the following parameters are met: 

- 10- 
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the cumulative alignment score Mis off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algonthm 
parameters W, T and X determine the sensitivity and speed of the alignment. The BLAST 
5 programs may use as defaults a word length (W) of 1 1, the BLOSUM62 scoring matnx 

(Henikoff and Henikoff, 1992, Proc. Natl. Acad Sci. USA 89: 10915-10919) alignments (B) of 
50 expectation (E) of 10 (which may be changed in alternative embodiments to 1 or 0.1 or 
0 01 or 0 001 or 0.0001; although E values much higher than 0.1 may not identify functionally 
similar sequences, it is useful to examine hits with lower significance, E values between 0.1 
10 and 10, for short regions of simi^ 

strands. For protein compansons, BLASTP may be used with defaults as follows: G=l 1 (cost 
to open a gap); E-l (cost to extend a gap); E=10 (expectation value, at this setting, 10 hits with 
scores equal to or better than the defined alignment score, S, are expected to occur by chance 
in a database of the same size as the one being searched; the E value can be increased or 
15 decreased to alter the stringency of the search.); andW=3 (word size, defaultis 11 for 
BLASTN, 3 for other blast programs). 

The BLOSUM matrix assigns a probability score for each position in an alignment that 
is based on the frequency with which that substitution is known to occur among consensus 
20 blocks within related proteins. The BLOSUM62 (gap existence cost = 1 1 ; per residue gap cost 
= 1- lambda ratio = 0.85) substitution matrix is used by default in BLAST 2.0. A variety of 
other matrices may be used as alternatives to BLOSUM62, including: PAM30 (settings: 
9 1 0.87); PAM70 (settings: 10,1,0.87) BLOSUM80 (settings: 10,1,0.87); BLOSUM62 
(settings- 1 1 1,0.82) and BLOSUM45 (settings: 14,2,0.87). One measure of the statistical 
25 similarity between two sequences using the BLAST algorithm is the smallest sum probability 
(P(N)) which provides an indication of the probability by which a match between two 
nucleotide or amino acid sequences would occur by chance. In alternative embodiments of the 
invention, nucleotide or amino acid sequences may be identified as sequences of the invention 
where the smallest sum probability in a comparison to a reference KCS2 amino acid or KCS2 
30 nucleic acid test sequence is less than about 1, preferably less than about 0.1, more preferably 
less than about 0.01 , and most preferably less than about 0.001 . 

When a position in the compared sequence is occupied by the same nucleotide or amino 
acid, following optimal alignment of the sequences, the molecules are considered to have 



11 - 



WO 01/07586 



PCT/CAOO/00860 



identity at that position. The degree of identity between sequences is a function of the number 
of matching positions shared by the sequences. In terms of percentage, identity is the sum of 
identical positions, divided by the total length over which the sequences are aligned, multiplied 
by 100. 

It will be recognized by one of ordinary skill in the art that nucleic acids of this 
invention may be modified using standard techniques of site specific mutation or PCR, or 
modification of the sequence may be accomplished in producing a synthetic nucleic acid 
sequence. Such modified sequences are also considered in this invention. For example, due to 
the degeneracy of the genetic code, which is well-known to the art; i.e., for many amino acids, 
there is more than one nucleotide triplet which serve as the codon for the amino acid, codons 
may be changed such that the nucleic acid sequence encodes the same amino acid sequence, or 
alternatively, codons may be altered such that conservative amino acid substitutions or 
substitutions of similar amino acids result without affecting protein function. 

In another aspect of the invention, the nucleic acid may further comprise a promoter 
operably linked to the enzyme-encoding region. A first nucleic acid sequence is operably 
linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a 
functional relationship with the second nucleic acid sequence. For instance, a promoter is 
0 operably linked to a coding sequence if the promoter affects the transcription or expression of 
the coding sequences. Generally, operably linked DNA sequences are contiguous and, where 
necessary to join two protein coding regions, in reading frame. 

A promoter, as used herein, is a DNA sequence in a gene, usually (but not necessarily) 
25 upstream (5') to its coding sequence, which controls the expression of the coding sequence by 
providing the recognition for RNA polymerase and other factors required for proper 
transcription. In artificial DNA constructs promoters may also be used to transcribe antisense 
RNA. Promoters may also contain DNA sequences that are involved in the binding of protein 
factors which control the effectiveness of transcription initiation in response to physiological or 
30 developmental conditions. A promoter may also contain enhancer elements, a DNA sequence 
which can stimulate promoter activity. A gene of the present invention may also include 
transcription termination signals. 
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In one embodiment of the invention, the nucleic acid may be used to alter the 
composition and accumulation of fatty acids in plant cells. For example, for over-expression, 
which refers to production of a gene product exceeding levels of production in normal or non- 
transformed organisms, a plant promoter may be employed which will direct expression of the 

5 elongase in all tissues of a regenerated plant. Over-expression of the gene, for example, in 
plant epidermal cells could increase cuticle accumulation thereby increasing drought and stress 
tolerance of transgenic plants over control plants. Such promoters are referred to herein as 
"constitutive" promoters and are active under most environmental conditions and states of 
development or cell differentiation. Examples of constitutive promoters include the 

10 cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1 or 2'- promoter 
derived from T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions 
from various plant genes known to those of skill in the art. 

In various embodiments, the invention comprises plants transformed with the nucleic 
15 acids of the invention. In some embodiments, such plants will exhibit altered fatty acid content 
in one or more tissues. These aspects of the invention relate to all higher plants, including 
monocots and dicots, such as species from the genera Fragaria. Lotus, Medicago, Onobrychis, 
Triforium, Trigonelia, Wgna, Citrus, Linum. Geranium, Manihot, Caucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, 
20 Solarium, Petunia, Digitalis, Majorana. Cichorium, Helianthus, Lactuca, Bromus, Asparagus, 
Antirrhinum, Heterocatlis, Nemesia, Pelargonium, Panicum, Penniserum, Ranunculus, 
Senecio, Salpiglossis, Cucarnis, Browallia, Glycine, Lolium, lea, Triticum, Sorghum, and 
Datura. Such plants may include maize, wheat, rice, barley, soybean, beans, rapeseed, canola, 
alfalfa, flax, sunflower, cotton, clover, lettuce, tomato cucurbits, potato carrot, radish, pea 
25 lentils, cabbage, broccoli, brussel sprouts, peppers, apple, pear, peach, apricot, carnations and 
roses. More specifically, in alternative embodiments, plants for which the invention may be 
used in modifying fatty acid content include oil crops of the Cruciferae family: canola, 
rapeseed {Brassica spp.), crambe {Crambe spp.), honesty {Lunaria spp.) lesquereUa 
{Lesquerela spp.), and others; the Composirae family: sunflower {Helianthus spp.), safflower 
30 {Carthamus spp.), niger {Guizotia spp.) and others; the Palmae family: palm {Elaeis spp.), 
coconut {Cocos spp.) and others; the Leguminosae family: peanut {Arachis spp.), soybean 
{Glycine spp.) and others; and plants of other families such as maize {Zea spp.), cotton 
{Gossvpiun sp.), jojoba {Simonasia sp.), flax {Linum sp.), sesame {Sesamum spp.), castor bean 
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(Ricinus spp.), olive (Oka spp.), poppy (Papaver spp.), spurge (Euphorbia, spp.), 
meadowfoam (Limnanthes spp.), mustard (Sinapis spp.) and cuphea (Cuphea spp.). 

The promoter of the invention may direct expression of the elongase gene in a specific 
tissue or may be otherwise under more precise environmental or developmental control. Such 
promoters are referred to here as "inducible" promoters. Examples of environmental 
conditions that may effect transcription by inducible promoters include anaerobic conditions, 
elevated temperature, or the presence of light. Examples of promoters under developmental 
control include promoters that initiate transcription only in certain tissues, such as fruit, seeds, 
or flowers. 

In other embodiments, a nucleic acid disclosed herein may be used for cosuppression or 
antisense inhibition. Antisense inhibition refers to the production of antisense RN A transcripts 
capable of preventing or reducing the expression of the target protein. The term "antisense" as 
used herein refers to a nucleotide sequence in which the sequence of residues is in reverse 5' to 
3' orientation in relation to the sequence of residues in a sense strand of a nucleic acid coding 
sequence. A "sense" or "coding" strand or sequence refers to a sequence that may be 
transcribed by a cell into an mRNA encoding a protein. An "antisense" sequence will therefore 
generally have the same sequence as the non-coding strand in a DN A duplex, and an antisense 
RNA will be complementary to an "antisense" sequence. Co-suppression refers to the 
phenomenon in which expression of a foreign gene which has substantial homology to an 
endogenous gene results in the suppression of expression of both the foreign and the 
endogenous gene. 

As used herein, the terms "complementary" or "complementarity" when used in 
reference to polynucleotides refer to polynucleotides which are related by the base-pairing 
rules, including the rules governing complementarity between DNA and RNA. For example, 
the sequence 5'-AGT-3' is complementary to the sequence 5'-ACT-3'. 

In another embodiment of the invention, the KCS2 promoter disclosed herein may be 
operably linked to agronomically important genes, other than the elongase gene disclosed in 
the present invention. For example, the KCS2 promoter may be operably linked to the Bt gene, 
encoding a protein insecticide, to confer insect resistance on the transformed plant (US Patent 
No. 5,723,756). The KCS2 promoter may be operably linked to genes encoding proteins, 
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which have fungal, bacterial and viral inhibiting effects, such as lysozyme, chitinase, attacin, 
and cecropin (US Patent No. 5,597,946) to confer fungal/bacterial resistance on transformed 
plants. 

A cell, tissue, organ, or organism into which has been introduced a foreign nucleic acid, 
is considered "transformed", "transfected", or "transgenic". A transgenic or transformed cell 
or organism also includes progeny of the cell or organism and progeny produced from a 
breeding program employing a transgenic plant as a parent in a cross and exhibiting an altered 
phenotype resulting from the presence of a recombinant nucleic acid construct. A transgenic 
plant is therefore a plant that has been transformed with a heterologous nucleic acid, or the 
progeny of such a plant that includes the transgene. The invention provides vectors, such as 
vectors for transforming plants or plant cells. The term "vector" in reference to nucleic acid 
molecule generally refers to a molecule that may be used to transfer a nucleic acid segments) 
from one cell to another. 

Another aspect of the invention provides transgenic plant cells, plant tissues derived 
from such plant cells, and descendants thereof. Nucleic acids of the invention may be 
introduced into the genome of the desired plant host by a variety of conventional techniques 
which include, without limitation, electroporation and microinjection of plant cell prototplasts 
and polyethylene glycol precipitation (such as are disclosed in Paszkowski et al. (1984); 
Fromm et al., (1985); Rogers et al, (1986); and in U.S Patent Nos. 4,684,61 1; 4,801,540; 
4,743.548 and 5,231,019), ballistic methods such as DNA particle bombardment (for example 
as disclosed in Klein, etaL (1987); Gordon-Kamm, et al. (1990); and in U.S. Patent Nos. 
4,945,050; 5,015,580; 5,149,655 and 5,466,587); Agrobacterium-mediaied transformation 
methods (such as those disclosed in Horsch et al. Science (1984); Fraley et al. (1983); and 
U.S. Patent Nos. 4,940,838 and 5,464,763). Alternative transformation protocols are disclosed 
for example in U.S. Pat. No. 5,584,807; 5,501,967; Fraley et al. (1982) Proc. Natl Acad Sci. 
USA 79:1859-1863; Krens et al. (1982) Nature 296:72-74). 

Transformed plant cells, which may be derived by any of the above transformation 
techniques, may be cultured to regenerate whole plants having the transformed genotype and 
displaying a desired phenotype, as for altered VLCFA levels. A variety of plant culture 
techniques may be used to regenerate whole plants, such as described in Gamborg and Phillips 
(1995); Evans et al (1983); or Binding, (1985); or in Klee (1987). 
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One of skill will recognize that after the nucleic acid is stably incorporated in 
transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual 
crossing. Any of a number of standard breeding techniques may be used, depending upon the 

species to be crossed. 

Nucleic acids of the invention may also be used as a plant breeding tool, as molecular 
markers to aid in plant breeding programs. Such techniques would include using the gene 
itself as a molecular probe or using the DNA sequence to design PCR primers to use PCR 
based screening techniques in plant breeding programs. 

The invention now being generally described, it will be more readily understood by 
references to the following examples, which are included for purposes of illustration only and 
are not intended to limit the invention unless so stated. 



3 



Example 1 
Honing KCS2 

Following discovery and sequencing of the region of KCS2 in Arabidopsis. synthetic 
oligonucleotides homologous to portions of the KCS2 genomic sequence were prepared and 
used as primers to amplify either 1508 bp or 1001 bp of continuous KCS2 DNA sequence by 
PCR As shown in Figure 1, the upstream primer was 5'-GTATCATCAACAAAAATATC-3' 
(kcsl) in combination with the downstream primer 5'-CAAAGATCGATCTTAACC-3' (kcs2) 
for the PCR-synthesis of the 1508 bp genomic DNA fragment, which includes a complete 
coding sequence. Primers 5'-CGATCACGGAGTAGAGAA-3' (kcs3) and 5'- 
GG AC AGTTTCT AAAGC AG-3 ' (kcs4) were used for the PCR-amplification of the 1001 bp 
of the 5' untranslated region (promoter fragment). The amplified 1508 bp fragment was 
subcloned in the EcoRI site of the plasmid pCR2.1 (Invitrogen) to produce plasmid pCR- 
KCS2 whereas the 1001 bp promoter fragment was subcloned in the Smal site of the plasmid 
pGEM7zf (Promega) to produce plasmid P GEM-proKCS2. The cloned DNA fragments were 
then completely sequenced on both strands using an automatic sequencer by the dideoxy chain 
termination method. These two fragments together comprise 2508 bp of continuous genomic 
DNA sequence shown in Figure 1. This DNA sequence represents a 1001 bp 5" untranslated 
■ region (i.e., nucleotides preceding the first ATG codon), and a 1461 bp open reading frame that 
encodes a 487 amino acid protein with a predicted molecular weight of 54,900 Da (KCS2). 
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Example 2 

Ex pression of g^S? »" Yeast 

The full length KCS2 cDNA was expressed in Saccharomyces cerevisiae linked to the 
GAL10 inducible promoter. The KCS2 gene was cleaved from P CR2.1 with EcoRI and 
inserted into the EcoRI site of the polylinker of the S. cerevisiae expression vector pESC-TRP 
(Stratagene), resulting in the plasmid P ESC-KCS2. The construct was transformed into the S. 
cerevisiae strain YPHY99 by standard procedures (Schiestl and Gietz, 1989) and transfonnants 
selected on minimal medium agar plates lacking tryptophan (Ausubel et al., 1995). 



10 



When yeast cells harboring the pESC-/CC52 construct were grown in the presence of 
galactose, 18:0, 20:0, 22:0 and 24:0 fatty acids accumulated (which are not normally present in 
the YPHY99 yeast stain, and were not detected in cultures of yeast transfonned with the pESC 
plasmid alone). Previous work on FAE1 indicates that the yeast expression system is a 
15 predictive model of elongase conder^mg er,zyme activity in plants (Millar and Kunst, 1997). 
Accordingly, the yeast expression results disclosed herein shows that the KCS2 gene encodes 
an elongase (condensing) enzyme capable of mediating biosynthesis of saturated VLCFAs, 
including 18:0, 20:0, 22:0 and 24:0 fatty acids, in a wide variety of organisms. 

20 Example 3 

Artivitv of the T i^iiP-Srecific Promoter 

In order to confirm and more precisely delineate KCS2 expression patterns, 5' flanking 
sequences from the KCS2 genomic clone were translationally fused (operatively linked) to the 
uidA reporter gene encoding beta-glucuronidase (GUS). The inserts were cleaved out of 
25 pGEM7zf with BamHI and Xbal and directionally subcloned into the conespondmg sites of 
the binary Ti plasmid pBHOl (Clontech), which contains a promoterless GUS gene (Jefferson 
et al 1987) The pKCS2-GUS fusion construct in pBHOl was introduced into Agrobacterium 
tumefaciens strain GV3101 (Koncz and SchelL 1986) by heat-shock and selected for resistance 
to kanamycin (50 (g/ml). 

Thep£CS2-GUS fusion was introduced into Arabidopsis using the floral dip method 
(Clough and Bent, 1998). Screening for transformed seed was done on 50 micrograms/mL 
kanamycin as described previously (Katavic et al., 1994). The KCS2 promoter expression 
partem was determined using GUS histochemical assays using buffer containing X-GLUC on 



30 



17- 



WO 01/07586 



PCT/CA00/00860 



whole seedlings, leaves at different stages of development, stems, flowers, siliques and roots 
on a large number of pKCS2-GUS transgenic lines. Wild type plants treated with GUS buffer 
under the same conditions were used as controls. 

5 The expression pattern observed for the KCS2 promoter in the pKCS2-GUS fusion was 

consistent in all transgenic lines tested. GUS activity was only observed in the anthers and very 
young emerging leaves (1-2 mm in size). No GUS expression was detected in any of the other 
tissues or the wild type controls. Expression of KCS2 in the anthers was confirmed by RNA 
blot analysis, using the full KCS2 ORF (Open Reading Frame) as a probe, confinning that the 

10 Arabidopsis KCS2 promoter mediates the expression of heterologous proteins in a tissue 
specific manner. 



Example 4 

^oiycie n f th P S pecificity of *g KP.S2 Gene Product 
15 Expression of the KCS2 gene under the control of GAL10, a galactose-inducible 

promoter in yeast cells expressing pESC-KCS2 results in the appearance of four extra peaks in 
GC chromatograms. These peaks correspond to the saturated VLCFAs 20:0, 22:0, 24:0 and 
26:0, suggesting KCS2 condensing enzyme can elongate fatty acids from C18 to C24 in length. 



20 



25 



To test the ability of KCS2 to elongate an acyl chain inplanta, the full KCS2 coding 
sequence was placed behind the FAE1 seed specific promoter. The recipient plant used for the 
experiment was the mutant CB25, which contains a lesion in the FAE1 gene, resulting in a 
truncated FAE1 protein. Thus, CB25 plants do not make VLCFAs in the seeds, and all the 
VLCFAs observed would be the product of the KCS2 condensing enzyme. 



comp 



Gas chromatography analysis was performed on seeds of 50 transgenic lines and 
spared to CB25 seeds. A number of lines showed a fatty acid profile different from the one 
observed for CB25 seeds. The most dramatic difference was observed in the levels of mono- 
unsaturated C20:l fatty acid, which in some lines shows an increase of almost 100-fold when 
30 compared to the control, followed by C22:l fatty acid and C20:0 fatty acid, as shown in Table 
1. 
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Table 1. 

Fatty Acid Profile of KCS2 Expression 




The identity of the peaks was estimated by comparing their retention times to known 
fatty acid standards. 



10 



15 



20 



Example 5 

Utility of for Seed O il Modification 

Results of the experiments described above show that the KCS2 gene is primarily 
expressed in flower buds and demonstrate that KCS2 condensing enzyme can elongate 
VLCFAs inArabidopsis seeds. In an attempt to co-suppress KCS2 in order to obtain a loss-of- 
function phenotype, as well as a gain-of-function, over-expression phenotype, the strong 
CaMV 35S promoter was used to constitutively express KCS2 in all plant tissues. 

RNA blot analyses carried out on randomly selected transgenic lines to assess the level 
of expression of the ISS-KCS2 transgene demonstrated high levels of expression of the KCS2 
gene in a number of lines. To determine whether high levels of the KCS2 expression translated 
into higher levels of VLCFAs in the seed, seed of the 35S-KCS2 plants showing the highest 
levels of the KCS2 transcript were subjected to gas chromatography analyses. 

Fatty acid profiles of all the transgenic lines tested differ significandy from the wild 



25 type as shown in Figure 2. The main 



difference was observed in the level of 20:1 fatty acid 



(approximately 18% versus 12% in wild type), and a reduced 18:1 content. Apparendy, the 
expression ofiCCS2 in seeds increases the rate of conversion of 18:1 to 20:1 fatty acids and as 
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a result all the transgenic 35S-KCS2 lines accumulate significantly more total VLCFAs than 
the wild type seeds. A calculation of the proportion of VLCFAs with respect to total fatty 
acids showed that all the transgenic lines tested have over 20% of VLCFAs, whereas in the 
wild type VLCFAs account for 16.9% of total fatty acids. 
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! '* CONCLUSION 

Although various embodiments of the invention are disclosed herein, many adaptations 

25 and modifications may be made within the scope of the invention in accordance with the 
common general knowledge of those skilled in this art. Such modifications include the 
substitution of known equivalents for any aspect of the invention in order to achieve the same 
result in substantially the same way. Numeric ranges are inclusive of the numbers defining the 
range. In the specification, the word "comprising" is used as an open-ended term, substantially 

30 equivalent to the phrase "including, but not limited to", and the word "comprises" has a 
corresponding meaning. Citation of references herein shall not be construed as an admission 
that such references are prior art to the present invention. All publications, including but not 
limited to patents and patent applications, cited in this specification are incorporated herein by 
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reference as if each individual publication were specifically and individually indicated to be 
incorporated by reference herein and as though fully set forth herein. 
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